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PREFACE 

Hiis handbook is designed to acquainj foreign language departments 
in colleges and universities in the United States with recent develbprriehts 
in oral proficiency testing in foreign language programs. It is intended 
to pro .'ide rn overview of the ratibnale, development, procedures, arid 
irnplicaiions of oral proficiency evaluation in the context of the teaching 
mission of college arid university fpreig^ri^lariguage departments. Iri ad- 
ditipri. It reports the firidings of PROJECT OPT, a federally fUrided 
study to deteimirie the feasibility of iricdrpdratirig dral prdficiericy 
testirig iri cdllege-level fdreigri lariguage courses. 

it is important to note that this handbook is not intended to be a 
training manual but rather an introduction to the principles and techni- 
ques of orai proficiency testing. A Reference section provides additional 
information regarding the growing bibliography of printed sources and 
the expanding network of college and university professors with ex- 
perience in oral proficiency evaluation. Chairpersons and professors 
who would like further information are advised to contact David V. 
Hiple, Project Director, American Council on the Teaching of Foreign 
Languages, 579 Broadway, Hastings-on^Hudson, NYJ0706; Judith E. 
Liskin-Gasparrd, Senior Examiner, Educational Testing Service, 
Princeton, NJ 08541; or the authors pf thisharidbook at the Universi- 
ty of New Hampshire, Durham, Nili3824. Trairiirig for faculty 
members can be arranged through ACTFL on the home campus or is 
available at periodic workshops held around the country. 
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I. ORAL PROFICIENCY TESTING: AN OVERVIEW 
WHAT IS ORAL PROFICIENCY TESTING? 

Oral proficiency testing is ah evaulatibh process designed to deter- 
mine how well a person can speak a foreign language. The process con- 
sists of procedures that assess the ability bfthe speaker to use language 
effectively in cbrhmuhicative situations. Originally developed by the 
Foreign Service Institute of the U.S. Department of State in the 1950s, 
it has been adapted to academic settings in recent years in answer to 
a need expressed by many foreign language professors. 



WHY_ IS ORAL FROFICIENCY TESTING IMPORTANT IN 
ACADEMIC SETTINGS? 

Oral proficiency testing in academic settings is a recent development 
in foreign language teaching. It addresses a longstanding concern of 
college and university foreign language departments: the evaluation of 
oral skills. This concern is directly related to the teaching mission of 
foreign language faculties. The mission recognizes that the primary 
function of the department is to instruct majors in the foreign languages 
and literatures and that the secondary function is to provide language 
instruction for non-majors. 

Achieving departmental goals related to the teaching mission requires 
cbdrdinatidn of curriculum, instruction, and evaluation; As^ recent 
foreign language texts attest, course content and teaching techniques 
emphasize the development of oral skills for communicative functions; 
Evaluation of these oral skills, however, has not been sysjematjcally 
planned and implemented. In contrast, techniques for evaluating gram- 
matical knowledge, vocabulary a^QQisition, and reacting comprehen- 
sion are well-established compoiients of the foreign language cur^ 
riculnm. These techniques are primarily pencil ana paper tests that can 
be administered to groups of students and scored^objectively. They pro- 
vide valuable data relative to mastery of the morphological, syntac- 
tical, and lexical elements of the foreign language as well as the ability 
to comprehend and produce the written language. They do not, 
however, measure how well a student can communicate orally in the 
foreign language. 

The ability to cdmrntinicate orally in the foreign language is a primary 
goal of both students arid faculties iri fdreigri language departments. 
Irnplicit iri this gdal is the recdghitidri that mastery df a foreign language 
beriefits the iridividual and society in both humanistic and pragmatic 
ways. Thus iri the past twd decades, many foreign language depart- 
merits have exparided theif fdcus to include courses and concentrations 
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that complement the traditional literary orientation. Likewise, students 
have recognized the ancillary value of language proficiency while prepar- 
ing for careers in a broad range of other fields. The abUity to com- 
municate in a foreign language, even at iimited proficiency levels, is 
increasingly recognized as a valuable asset in such areas as business and 
commerce, the service professions, travel and recreation, and the 
government. 

ProTeosbrs who are aware of the important role of oral proficiency 
have realized that this area must be evaluated with reliable and valid 
testing techniques. Such techniques provide a cdmrnbn standard that 
can be applied to all students, regardless of iristitutibri, level of study, 
or type of curriculum. Futhermbre, they describe proficiency in terms 
that explain what students can actually say in the foreign language. This 
kind bf analysis prbvides valuable information to the professor, the 
student, arid prbspective employers. Until recently, validated tech- 
niques were ribt available for wider Use in foreign language departments 
iri schbbis, cblleges, arid Uriiversities, although some individual pro- 
fessors had developed their own procedures. 

WHAT IS HAPPENING IN ORAL PROFieiENeV TESTING? 

In 1979, Prjsideht Carter's Corhmissipri bri Fbreigri Language arid 
Iriterriatibnal Studies propbsed, as part bf a series of recbrrimeridatibris 
to improve foreign language iristructibri iri the Uriited States, that 
language prbficiericy gbals arid giiideliries fbr evaluatibri be establish- 
ed. The siiggestibri was a timely brie for twb reasbris. 

In the first place, those foreign language professors who had themselves 
learned a foreign language in government-sponsored agencies, such as 
the Defense Language Institute, the Foreign Service Institute, or the 
Peace Corps, were aware of specific procedures to rate oral proficien- 
cy. These procedures, which have been refined over a period bf 30 years, 
are now used regularly by the approximately 30 government agencies 
that constitute the Interagency Language Roundtable (ILR), They are 
the primary means to determine the furictibnal language ability bf can- 
didates for the thousands of gbvernmerit positions that require a 
knowledge of a foreign language. 

In the second place, the Educational Testing Service (ETS), a private 
educational agency that develops evaluation procedures and in- 
struments, and the American Council on the Teaching of Foreign 
Languages (ACTFL), a professional organization of foreign language 
teachers, were already addressing the problem, in 1980, ETS, with 
funH'ng from the U:S: Department of Education, undertook a project 
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entitled **A Common Metric for Language Proficiency." The result 
was an adaptation of the Interagency Language Roundtabie proficien- 
cy rating scale to reflect the needs and realities of the academic setting. 
Subsequently AGTFL, in projects also funded by the U.S. Department 
of Education, sponsored workshops to train foreign language teachers 
in the use of the new scale. It also produced a series of proficiency 
guidelines, written by teams of college and university language pro- 
fessors, in the five areas of listening, speaking, reading, writing, and 
culture. 

The response to the President's Commission has been a collaborative 
effort by ETS, ACTFL, and foreign language professors working in 
their own institutions to develop procedures, ^rain testers, and in- 
vestigate the impact and implications of ntilization of oral proficiency 
tests: The most highly developed and commonly used procedure is the 
ACTFL/ETS Oral Proficiency Inter vjew and Rating Scale, which will 
be discussed in Section 1! of this handbook: 

Since 1982, the training of testers has taken place in brie-day 
familiarizatibri workshops arid fbUr-day interisive workshops iri several 
parts of the couritry. Irivesfigatibri of the impact arid implicatibris is 
a cbritiriUirig prbcess that has rari:iificatibns fbr the develbpririerit bf 
cburse arid program goals, the desigri of instrlictibrial activities, arid 
the deV^lbpmerit bf texts arid other teaching materials. Ceriters fbr 
research arid training have beeri established, projects funded, arid riet- 
works fbrriled to provide a sbUrid basis for continued developmerit arid 
implemeritatibri. 

The successful development bf techniques for oral proficiency testing 
constitutes a niajbr step toward realizirig the goal of communicative 
competence iri foreign language learning and teaching. There are, 
however, nibre far-reaching implications than simply the evaluation of 
speaking skill. Research ahd development have been expanded to iri- 
cjude the other modalities bf listeriirig, readiri^, writirig, and culture, 
Cbhsequently, proficiericy as an brganizirig principle represents a vital 
arid powerful concept for fiilfillirig the teach i rig ririissibri bf fbreign 
language departrifients in the colleges and universities. 
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11. THE ACTFT./ETS ORAL PROFrc^^ 
EVAtUATIGN PROCESS 

WHAT IS THE PROCESS? 

The ACTFL/ETS oral proficiency evaluation process is designed to 
provide a measure ot functionm language alility that describes how 
well a person can use the foreign language in real-life settings, regardless 
of how it has been learned or acquired. It Is a modification, for academic 
settings, of the Interagency Language Roundtable (ILR) interview pro- 
cess. The process consists of a conversational interview and the rating 
of the resulting speech sample. The interview, which provides the 
language sample to be evaluated, is a highly structured conversatibh 
based on levels determined by the rating scale. Since ah understaridirig 
of the rating scale is fundamental to conducting a successful interview, 
the scale will be discussed first. 



WHAT IS THE ACTFL/ETS RATING SCALE? 

The ACTFL/ETS rating scale is a descriptive ihstrumeht that defines 
levels of real-life language proficiency. It is ajhbdificatibn of the ILR 
scale, which was initially developed by the Foreign Service Institute 
(FSI): Figure I presents a comparison of the two scales. 

The central cone of the figure represents, the speaker's furictibrial 
language ability, as demonstrated in the oral interview. The ILR scale, 
which has five levels of proficiency rating frbm 0 {**hb fuhctibrial 
language ability") to 5 (proficiency equivalent to that bf ah ''educated 
native speaker") is indicated oh the left side bf the figure. The 
AGTFL/ETS scale, which is ah expahsibh bf the lower ends of the ILR 
scale, is indicated on the jighLside of the figure. Levels are identified 
by descriptors such as **S-1_ (Spieakihg)" in the ILR scale which cor- 
responds to ''Intermediate" in the ACTFL/ETS scale. Each level and 
sub-division within levels (Low, Mid, High) is defined by a descriptive 
paragraph . These descriptive paragraphs comprise the rating scale: The 
complete ILR and ACTFL/ETS rating scales are found in the 
Appendices. 

Based on research and experience with the ILR scale, ACTFL and 
ETS expanded the lower end of that scale (0, "no fuhctibnal ability," 
to 2, "limited working proficiency") to achieve a more accurate measure 
of discrimination in the ranges most commbniy achieved by students 
111 classroom learning situations. Thus, the ACTFL/ETS rating scale 
consists of ranges of increasingly larger increments on a scale bf Novice- 
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Low to Superior (0 to 3 on the ILR scale). Each level and its related 
subdivisions are characterized by a single-sentence identifier that is 
fdlldwed by a descriptive paragraph. For example, a **Novice-Low" is 



Unabte tojuncticn in the spoken language. Oral production is limited 
to occasional isolated words. EiisentirJly no communicative ability. 



Figure i 
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An ''Advanced Plus," on the other hand, is 

Able to satisfy most Work requiremehts and show Qbiiity to com- 
municate on concrete topics retating to parricutar interests and spedai 
fields af cotnpeience. Often shows remarkable riuency and ease of 
speech, but under terisidri or pressure i^inguage may break down. 
Weaknesses or uhevehhess in one of the foregoing or in pronuncia- 
tion result in occasional miscdmmunication. Areas of weakness range 
from simple cbhstrUctidris such as plurals, articles, preposftions, and 
negatives to more complex structures such as tense usage, passive 
constructions, word order, and relative clauses. Normaiiy controls 
general vocabulary with some groping for everyday vocabulary still 
evident. 

At each level of the scale, function, cbritext, arid accuracy re- 
quirements comprise the description of prbficiericy. Function refers to 
the tasks accomplished, attitudes expressed, and tone conveyed by the 
student. €ontext refers to the topics, subiect areas, activities, and jobs 
the student is able to address. Accuracy refers to the acceptability, quali- 
ty, and accuracy of the message cbriveyed by the student. Each inter- 
view is rated by taking intbaccburit all three of these areas. The Func- 
tional Trisection (see Figure II) prbvides a succinct summary of the levels 
of the scale. 

A ratirig is assigned by determining the hjghest level of sustained 
performarice as evidenced by patterns of strengths and weaknesses. It 
shbuld be rioted that two different speakers may vary in the way they 
derribristfate content, function, and accuracy, but still fall in the same 
level of prdficiency. Furthermor;, the levels of the scale represe^nt ranges 
of iricf easing magnitude rather than equidistant points on a fixed lirie, 
Thus, as one progresses up the scale^ the quantity and quality^ bf 
larigUage Increase geometrically, not arithrnetically. This progressibn 
is graphically illustrated in the cone-shape of Figure I, where each level 
is represented by increasingly larger segments. 

Rating is normally dbrie frbrii a tape after the interview has been 
conducted. This rembves the tester ffdm the pressure of simultaneous 
interviewing and rating. It alsd alldws for review of the interview as 
many times as riecessary arid with some psychological distance. Fur- 
thermore, the rating can be verified by other testers. 
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Figure II 

FUNCTIONAL TRISECTION OF ORAL PROFICIENeY LEVELS 
Oral 

Proficiency 

Level FUriclldri Cnritexl Accuracy 

(Tasks accomplished, (Topics, subject areas, (Acceptability, quality 
aiijiudes expressed, tone aciivmes and jobs arid accuracy of 
conveyed a ddressed jpessage conveyed) 

5 Functions cquivalerii All subjects. Perfdrmaricc equivalent 

(Superior) to an educated native to an ENS. 

speaker (ENS). - 

4_ Able to tailor language All topics normally jsiearly equivalent to an 

(Superior) to fit audierice, cburisci, pertinent to prdfcssidnal ENS. Speech is extcn- 

persuade, negotiate, needs. sive, precise, appro- 

repircserit a pdirit df view priatc td every occasion 

and interpret for with only occasional 

digriitaries. errors. _ 

3 Can converse in formal Practical, social pro- Errors never interfere 

(Superior) arid irifdrriial situatidrts, fcssidrial and abstract with understanding and 

resoive prpblem situa- topics, particular inter- rarely disturb the native 

lidns, deal wi'h un- ests, and special fields speaker. Only sporadic 

familiar topics, provide of competence. errors in basic structure. 

explanations, describe in 

detail, offer supported 

dpinions, and 

hypothesize. 

2 Able to fully participate Concrete topics such as Understandable to 

(Advanced) In casual conversations, own background, family, native speaker /lo/ used 

can express facts, give interests, work, travel, to dealing with . 

instructions, describe, and current events. foreigrrers sjjmetlmes 

report, and provide miscommunicates. 

narration about current, 
past and future activities. 

1 Can create with the P^eryday survival topics Intelligible to native 

(Intermediate) language, ask and answer and courtesy speaker used to dealing 

questions, participate in requirements. v ith fdreigriers. 
short conversat i ons. 

0* No furictidrial ability. None. Unintelligible. 



*Novice Level is riot discussed here. 
Used with permission df Pardee Ldwe, Jr. 
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H0W IS THE INTERVIEW CONDUCTED? 

The interview is a structured conversation that enables the tester to 
determine what the student can do and what the student cannot do when 
speaking the target language: Since the object of the interview is Jhe 
assessment of language proficiency, a ratable sample must be obtamed 
from the speaker. It foilows, therefore, that each question POsed by 
the tester must have a purpose. Structured questioning jiot only shortens 
the length of the interview, but also improves the sample, thus allow- 
ing the tester to evaluate more accurately: 

To ensure that questibriihg is structured, the interview is divided in- 
to four phases. These four phases are: warm-up, level check, probes, 
and wind-down. Such a structure avoids purposeless qUestibriihg that 
lengthens the interview and generally results in an uriratable sample. 

The wurm-up is designed to put the speaker at ease and, if necessary, 
to reacquaint him/her with the target language. It consists of social 
amenities and very simple conversation, including salutations and in- 
troductions: The warm-up is concluded when the interviewer has ob- 
tained a preliminary indication of the student's level and when the stu- 
dent seems relaxed and comfortable with the testing situation. 

The tevei check provides information regarding what the speaker can 
do. It is designed to find the highest level at which the student can sus- 
tain a speaking performance, its purpose is to verify the estimate of 
level obtained at the end of the warm-up. During this stage the tester 
checks a number of topics for content and vocabulary, listens for fluen- 
cy, and assesses pronunciation and grammatical accuracy. When the 
speaker is able to perform the functions of the estimated level with 
suitable accuracy and content, the solid floor of successful performance 
has been established. This floor represents what the speaker can do 
consistently. 

Probes provide information regarding what the speaker cannot do. 
In contrast to *Mevei checks," which reveal what a speaker can do con- 
sistently, thus determining the floor of the performance, **probes" are 
used to find the ceiling for the rating. The probe stage takes the stu- 
dent above the sustained level several times in different ways, explor- 
ing a variety of contexts as well as a variety of functions. The objective 
is to test the speaker to the highest level of proficiency. When the 
speaker can no longer communicate effectively, this objective has been 
realized and the testing part of the interview is complete. 

The wind-down is designed to return the speaker to the level at which 
he/she functions most accurately. This is done to provide the speaker 
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with a feeling of accomplishment at the close of the interview. The tester 
often returns briefly to a topic discussed previously and on which the 
speaker had been able to converse with a considerable degree of con- 
fidence and ease. The interview generally ends, as it began, with the 
appropriate social amenities. 

During the course of the interview, testers are faced with the task 
of generating questions appropriate to the various stages of the inter- 
view arid, coricUffcritly, appropriate to the student's apparent level of 
language proficiency. They often prepare types of questions suitable 
to each level before the interview; The actual questions, however, are 
developed as the interview progresses, according to the information that 
the tester is gathering about the speaker: 

In addition to following a question/answer pattern, speakers are ask- 
ed to role-play situations with the interviewer for levels above Novice. 
These role-plays are designed to give the speaker ah bppbrtUriity to 
demonstrate how well he/she can get into, through, arid but bf a situa- 
tion that brie might ribrmally encburiter wheri Using the foreign 
language. They require that the speaker ask questibris, sustairi a cbn- 
versatibn arid, at higher levels, deal with cbrtiplicatibris. 



HOW CAN ONE LEARN TO GIVE THIS TEST? 

The interviewer is not a teacher but rather an evaluatpr. The twb 
roles must be kept separate when conducting interviews. Although riiariy 
foreign language professors have had training in pedagbgy arid riiay 
possess a native, or near-native, cbmrharid of the language they teach, 
it does not follow that they can automatically cbnduct an oral profi- 
ciency interview. Careful training arid a substaritial number bf prac- 
tice interviews are necessary befbre even the best teacher cari elicit a 
ratable sample frbm a speaker arid subsequeritly rate the sample ac- 
curately. Such trairiirig^is available in wbrkshbps cbridiicted by in- 
dividuals trairied by ACTFL arid ETS. 

An introduction to the principles and procedures of the ACTFL/ETS 
oral proficiency interview and rating scale can be obtained by atten- 
ding a familiarization workshop of one or two days duration. Participa- 
tion in an intensive workshop of four or five days duration, followed 
by post-workshop training (in which participants conduct practice in- 
terviews that are critiqued by their trainers), is a highly recommended 
precondition for certification as an interviewer and rater. For infor- 
mation regarding workshops and tester certification, consult the sources 
mentioned in the Preface of this handbook. 
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111. USES OF THE AETFL/ETS 0RAL PROFICIENCY 
EVALUATION PROCESS 

The ACTFL/ETS braljjrdficiency interview is a modification of the 
Interagency Language RoUndtable interview process that reflects 
reasonable expectations for proficiency development in academic set- 
tings. By expanding the lower ends of the ILR scale (0-3), ACTFL and 
ETS have provided a more accurate measure of discrimination in the 
ranges most cdmmdrily achieved by students in classroom learning sitaa- 
tibhs. Since the ILR and ACTFL/ETS processes and rating scales are 
mutually compatible, they are appropriate for use in a variety of set- 
tings and for a variety of functions. 

HOW IS THE PROCESS BEING USED 
IN ACADEMIC SITUATIONS? 

The techniques and principles underlying ^He oral proficiency inter- 
view serve a broad range of interests for those involved in language 
instructibri in cblleges arid Universities, Sbrrie of the rridst cdmmdri ap- 
plicatibris, as repbrted iri the March, 1984 issUe df POPT, the newslet- 
ter of PROJECT OPT, are: 

1 . Piacement in foreign ianguage courses: Appropriate placement 
of incoming students in the elementary, intermediate, and advanced 
level courses is necessary for both the individual and for the class in 
order to provide optimal learning conditions. Many institutibris ad- 
minister a written placement examination to determine if students have 
the necessary command of vocabulary and structure tb enroll iri specific 
courses. In cases where oral prbficiency is ari impbrtant factbr, such 
as conversation courses, or where the scbre bri the writteri test falls iri 
the overlapping range of mbre than bne pbssible course, the process 
IS useful to suggest which cburse will be riibst beneficial, Curreritly it 
is being used for this purpbse at Bates College (Maine), Norwich Uriiver- 
sity (Vermunt), arid Cabrillo Cbllege (Califorriia). 

2, EvaTuation of oral skills in specific foreign language courses: Sdriie 
prbfessprs use the prbcess at the begiririirig bf the seriiester in in- 
termediate arid advanced cdurses td determine entry-level proficiency 
bf their studerits. There are twd benefits td such titilization. The inter- 
view prdvides an dbjective measure df what students can do prior to 
receivirig further iristructibri. The resultant evaluation can be used for 
diagribstic pUrpbses since thejprdfessdf has obtained a valid assessment 
bf strerigths arid weakriesses. The preliminary interview can also be com- 
pared with dne cdndUcted at the end of the course. It is important to 
ridte, however, that the levels df the ACTFL/ETS scale do not parallel 
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course levels and thus a change in proficiency level rhay not be a 
reasonable expectation. Furthermore, the inteview provides the oppor- 
tunity for professor and student to become acquainted in a less formal 
environment than in the classroom. This personal contact at the begin- 
ning of the course lays the groundwork for developing an in-class at- 
mosphere that fosters real communicative use of the foreign language. 

In addition to entry-level assessment, the oral interview is aiso used 
as prirt of the criteria for determining a final grade in specific courses^ 
It is important, however, to recognize that many factors, mciuding 
achievement-based measures, determine grades. A:s stated above, the 
levels of the ACTFL/ETS scale do not parallel coarse levels: Nejther 
do levels of proficiency equal course grades. Examples of this ^ype of 
application can be fdUnd in a (jerman advanced composition and con- 
versation course at Amherst College (Massachusetts), in elementary and 
intermediate French courses at the University of Illinois (tJrbana- 
Champaigri), in elementary French courses at the University of Wiscon- 
sin (Madison), and in Russian courses at the University of Vermont. 

3. Selectioh of students for a study-abroad program anddeterrriina- 
tion of the effects of such a prograrn: At Central College (Pella, I A), 
students are required to reach a strong 1 + or 2 on the ILR scale (In- 
termediate High or Advanced on the ACTFL/ETS scale) for accep- 
tance into the JYA program in Granada, Spain. At Keerie State Col- 
lege (New Hampshire), students of FrericK, German, and Spanish are 
tested before and after the study abroad program. Russian students 
attending the University of New Hampshire Associated Academic Pro- 
grams in Leningrad are tested, for placement purposes, at the begin- 
ning of the summer program and then again at the end to determine pro- 
gress in proficiency development. Likewise. Russian students enrolled 
in programs sponsored by the American Council of Teachers of Rus- 
siah at the Pushkin Institute in Moscow are tested and receive an oraj 
proficiency rating both prior to and after attending the semester and 
year courses. 

4: FuifiUment of specif ic program requirements: With the increas- 
ing demand for accountablility in education, it is important that foreign 
language departments be able to verify that students have completed 
such requirements as: 

a. The foreign language requirement: In most institutions that have 
a foreign language requirement, fulfillment of that requirement is 
achieved by completing a specific course or series of courses. This ap- 
proach leaves decisions regarding content, skill development, and level 
of achievement up to the department, or in some cases, to individual 
professors: It does not necessarily focus on what students should be 

G 1 

IB 



12 



able to do in the language, drice they have completed the necessary 
cbuise credits, nor does it provide a means of comparing what students 
at different institutions can do. The inclusion of an oral proficiency 
interview in the fequifement criteria, such as at the University of Penn- 
sylvania, where students are tested in French, German, Arabic, and 
Russian, assures that the oral skills have been evaluated in a way that 
has validity and meaning beyond the individual course or institution. 

b. Requirements for successful cbmpletibh of the foreign language 
major: Similar to the use of the oral proficiency evaluation process for 
non-majors fulfilling the language requirement is its application in deal- 
ing with majors and minors. Oral proficiency is a critical factor both in 
successful cbmpletiph of Uppier-divisibri courses and in pdst-graduatidri 
career uses bf the foreign language: therefore it inUst be evaluated in 
ah bbjective manner. The interview prdvides a mechanism for assess- 
ing prbficiehcy withbut impinging on the departmental prerogative for 
determining what students should study to cdrtiplete the major. It also 
provides a standard that can be interpreted by prdspective employers 
and graduate schbbls. 

c. Requirements for teacher educatibh prbgrams: The U.S. educa- 
tional system has been the focus bf bver tvyb dozen impbrtaht reports 
in the past three years. A recurrihg theme has been the heed for more 
competent teachers in all disciplines. One significant reactibh to the 
often critical remarks regarding teacher preparatibri is the mandating 
by state legislatures bf cbmpetency examihatibns for teacher education 
students. In fbreign languages, several states have developed written 
proficiency examihatibns or are requiring existing standardized national 
tests such as the National Teachers Examinatibn. A pildt prdject in 
Texas, funded by the U.S^ Department bf EdUcatibh and cdnducted 
under the aUspices bf the Texas Educatibh Agency, the Texas Foreign 
Language Assbciatibh, and ACTFL, is investigating what oral prdfi- 
ciency levels are apprbpriate for beginning teachers. By 1986 it is hoped 
that ah bral proficiency reqUirernent will be part df the fdfeign language 
teacher certificatibh process. In Central Cdllege (Pella, lA), teacher 
edUcatibh students mUst achieve a fating of 3 dn the ILR scale, 
equivalent tb a **SUperidr" on the ACTFL/ETS scale. 

As the specific academic applications described above illustrate, the 
Oral Proficiency Interview is being used for several purposes in foreign 
language departments in colleges and universities. A familiarization ^vith 
the AGTFL/ETS scale and prbficiency guidelines is prbvidihg foreign 
language educators with a better sense of specific needs bf students at 
the outset of each semester, with a more defined goal tb guide their 
course design during the semester, and with a markedly more delineated 
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and standardized method of evaluation at impiDrtarit checkpoints in their 
programs. 

WHAT WIDER APPLICATieNS RELATE TO THE TEACHING 
MISSION OF FOREIGN LAFslGUAGE DEPARTMENTS? 

Because the ACTFL/ETS oral proficiency evaluation process is based 
on the same principles and procedures as the Interagency tanguage 
Roundtable interview arid scale, the potential exists for extensive use 
outside of acadennic situatibris. Ratings that are determined by c^///7e^ 
evaluators cari become part of the student's transcript and/or dossier 
to indicate the level of prbficiericy achieved in the foreign ^1^^^^ 
These ratings, which are defined in terms of me content, function, and 
accuracy of the person's language skills, provide anobjective and stan- 
dardized appraisal of how well that person can communicate; They con- 
vey clearly to employers in industry, commerce, socia[ services, travel 
and eritertairiment, government, and other career fields requiring a func- 
tional kribwledge of a foreign language just what a student applying 
for a job cari do. 

it cannot be too strongly emphasized that such extra-academic uses 
of the oral piroflciency evaluation process are_prily valid when the in- 
terview and rating is done by a certified tester. The process, while seem- 
ingly simple to comprehend, reqiiires exterisive trairiirig arid practice 
in all phases of interviewing arid ratirig. To preserve the integrity of 
the system and the ratings, iridiyiduals arid institutions must resist the 
temptation to use ratings that have riot beeri obtained by a certified 
evaluator on official documents that become part of a person's academic 
file. Procedures have receritly been developed by ACTFL in conjunc- 
tion with ETS and ILR, to grarit official tester certification, which js 
renewable at specified iritervals, Upbri completion of appropriate train- 
ing in interviewing arid ratirig. 

IV, IMPtlGATIONS OF THE ORAL PROFICIENCY 
EVALUATION PROCESS 

implicit in the implementation of a precise and staridardized itiethbd 
of evaluation of oral skills is a wide-ranging effect bri the future of 
foreign language programs in the United States. It is expected that once 
the oral proficiency interview is adopted on a brbad scale by both secon- 
dary and higher education institutions across the cbUritry, it will have 
substantial impact on foreign language curricula as a whble, on specific 
course goals and objectives,, and on_the iridividUal teachlrig and testing 
techniques of language prbfessors. The implicatibris of this impact are 
discussed below. 

^ 1 8 
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WHAT ARE THE IMPLICATIONS 
FOR CURRICULUM DEVELOPMENT? 

Curriculum development concerns the definition of goals and ob- 
jectives of the foreign language program. It focuses on what studems 
will be expected to know (content) and do (skills) as a result of the in- 
struction they receive in the language, literature, and culture and civiliza- 
tion courses that constitute the departmental offerings. Use of oral pro- 
ficiency evaluatibn as ah organizing principle for curriculum is likely 
to have the fdllbwing implications: 

1. The teaching mission of foreign language faculties can be reex- 
amined and subsequently redefined in accord with a prdficiency-drierited 
approach to language instruction: If courses are taught with a fdciis dn 
building proficiency; then students can be expected td demonstrate func- 
tional use of the language in addition to completing cdurse 
requirements. 

2. Traditional level expectatidris, i.e. elementary, intermediate, ad- 
vanced, can be redefined in accdrdarice with empirically derived data, 
provided by the oral proficiency interview, relative to student ability 
to communicate orally. The definitidns df the proficiency levels of the 
ACTFL/ETS scale can prdvide a reasdnable structure of expectations 
for skill deveiopment. Thus, articulation between cdurses and levels 
can be smoother since expectations will be rtidre clearly defined. 

3. Ah attempt can be made to achieve more standardized proficien- 
cy expectations for graduating seniors and pre-service teacher candidates 
across the nation. Such standardization would ensure more uniform 
perfdrmahce criteria in use of the foreign language but would leave 
specific content decisions to the individual departments. 

WHAT ARE THE IMPLICATIONS FOR SPECIFIC COURSES? 

The curriculum of the foreign language department is realized and 
implemented through the specific courses that make up the program. 
The dral proficiency evaluation process is likely to have the following 
implicatidhs within specific courses; 

1. eourses^within each level, i.e: elementary, intermediate, arid ad- 
vanced, can be redesigned^ using the level descriptions of the ACTFL 
Proficiency Guidelines (the rating scale), to include course-specific 
mastery levels^ Such an appjication has already been implemented in 
an advanced French conversation course at Northwestern University 
in which course goals included demonstrating the ability to "narrate 
and describe in the past; present; and future." These goals correspond 
to the Advanced level of the ACTFL/ETS scale: 
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2. Courses designed specifically as a means of augmenting the con- 
versationai skills can play a more prdniinent role in the curriculum. 
In addition, such courses can be redesigned, based on a common stan- 
dard of proficiency, with the aim of defining more realistic expecta- 
tions with regard to what a student can actually say in the foreign 
language. 

3. Oral achievement tests, patterned oh the oral proficiency inter- 
view and based on course-specific material, can become a valuable tool 
in evaluating student ability to communicate. By administering more 
oral tests in their courses, professors will convey an important message 
to students. That message is that the department and the prbfesspr take 
seriously their responsibility to help students develop real proficiency 
in the foreign language. 



V/HAJ ARE THE IMPLICATIONS 
FOR TEACHING TECHNIQUES? 

During the past 4G years, techniques for teaching foreign languages 
have increasingly emphasize I the oral and cpmmunicative aspects of 
classroom learning. As a result of research findings that demonstrate 
a positive correlation between meaningful activities and amount of 
language learning, foreign language teachers have realized the impor- 
tance of creating a classroom environment in which students speak to 
each other about topics that are significant to the individual and to the 
group: The oral proficiency interview, which is itself a structured con- 
versation in which the speaker is led to increasingly higher levels of 
linguistic and discourse function, has the following implications for in- 
structional techniques: 

1. The use of the target language at all levels of instruction can play 
a more prominent role in the instructional process. Professors whose 
goals include teaching for proficiency realize the importance of pro- 
viding meaningful input in the target language. They communicate vvith 
their students in the target language, at a level that challehges but does 
not frustrate,^ on topics appropriate to the learning tasks. They also 
are aware of the fact that, given the appropriate input, skill in listen- 
ing comprehension develops more rapidly than skill in speaking arid 
therefore do not insist on extremely high levels of accuracy when 
students are attempting to express their ideas arid opinions. . 

2. To promote meaningful discourse among students, activities in 
pairs or sriiall groups can replace the more typical student -teacher ex- 
chariges that have characterized more traditional ianguage-learnir|g ap- 
proaches. In such activities, students assume responsibility for initiating 
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and maintaining conversation in addition to the ty pica] jDatterri of 
responding to direct questioris Usually posed by the professor. 

3. * 'Situations," such as those used in a structured oral proficiency 
interview, can be an effective method for developing a conversation 
similar to one the students might experience in the foreign country. Since 
these role-play situations reflect increasingly more complicated instances 
of typical daily encounters, they require use of the language at increas- 
ingly complex levels of function, content, and accuracy. In addition 
to gaining practice in meeting the linguistic challenges of such en- 
counters, students experience a sense of satisfaction and accomplish- 
ment v^hen they "get into, through, and but of" a simple survival situa- 
tion or one with a complication. These feelings help to build confidence 
and motivation to continue the process of learning the foreign language. 



WHAT ARE THE IMPLICATIONS FOR TESTING 
THE OTHER SKiLLS? 

The oral proficiency interview and rating scale were designed initially 
to evaluate speaking primarily and, by virtue of its utility in the pro- 
cess, listening comprehension secondarily; The development of the 
ACTFL Proficiency Guidelines in the skills of listening, reading, and 
writing has already generated interest and activity in the evaluation of 
proficiency in those areas. The Symposium on Receptive Language 
Skills, conducted by ACTFL in November, 1983, and reported in the 
September, 1984 issue of Foreign Language Annals, is evidence of the 
relevance of the concept of proficiency-based evaluation for all the 
skills. Some of the important implications include: 

1 : Testing in the other skill areas, as well as in speaking, will be focus- 
ed on functional language use in order to determine what students can 
understand, read, and write in circumstances that refiect real language 
for communication use: Such testing is more global than specific in 
nature and is designed to yield ratings that correspond to those in the 
oral proficiency rating scale: 

2, Such testing will prdvide further evidence that students progress 
at different rates jh the acquisition of different skills. For example, a 
typical total proficiency profile might indicate that a student is ah 
''Intermediate- High" in listeriirig and reading cbmprehehsiph, but an 
''Intermediate-Mid'' in writing arid possibly an "Iriterriiediate-Low" 
or even a"Novice-High" in speaking. These ratings should riot sur- 
prise professors sirice they reflect, to some exterit, a riatural order of 
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language acquisition arid, perhaps to a greater extent, typical priorities 
in classroom language learning. Professors who want to improve their 
students' ratings in certain skill areas such as speaking, in particular, 
can subsequently make changes in the materials and instructional techni- 
ques they use to help students achieve the desired goals. 



V. PROJECT OPT 

WHAT IS PROJECT OPT? 

PRGJEGT OPT is a research and training project on oral proficiency 
testing in college-ievel foreign language courses conducted at the Univer- 
sity of New Hampshire under a grant from the U.S. Department of 
Education in 1982-84. It was designed to achieve three objectives: 

1. to determine the feasibility of incorporating specific techniques 
of oral proficiency evaluation in foreign language courses at the col- 
lege level; 

2. to trairi professors to use the ACTFL/ETS oral proficiency inter- 
view procedures; 

3: to create a network in northern New England colleges and univer- 
sities of professors interested in oral proficiency evaluation: 

The jDrbject began with an intensive foiir-day training workshop in 
which 24 professors pTJ^rench^ German, Russian, arid Spanish were 
trained, to use the ACTFL/ETS oral proficiency interview and rating 
scale. Upbri returriirig to their hdrrie carripuses, the workshop par- 
ticiparits coridiicted 24 interviews, a sample of which was cntiqUed by 
experiericed eyaluatbrs frbrri siich institutibris as the Educational Testing 
Service, the Defense Language Institute, and Georgetown University. 
At a fbllbw-Up workshop six months later, participants met with the 
evalUatbrs arid addressed specific problem areas. Throughout the train- 
ing peribd, data, which were subsequently Used in the analysis reported 
below, were collected from the participants. 

Nine research questions associated with the principal objective of 
determining the feasibility of incorporating specific techriiques bf bral 
proficiency evaluation in fbreigri language cburses at the cplleg^e leyel 
were investigated. These questions are representative bf what faculty 
members often ask about the prbcedures and results bf oral prbficieri- 
cy evaluation, A brief sumriiary of the answers tb these questibris 
follows. 
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WHAT ANSWERS DlD PROJECILPPT PROVIDE TO 
QUESTIONS ABOUT ORAL PROFJCIENCY TESTING 
IN eOLLEGE— LEVEL FOREIGN LANGUAGE COURSES? 

*! . Can prdressors learn to conduct and rate oral proficiency interviews 
effectively after completing an intensive four-day workshop and a super- 
vised pfacticum that includes a critique of 24 interviews? 

The answer to this question is a qualified yes. When the study 
was cdnducted,_a procedure for certifying new testers had not yet 
been defined. Thus it v/as not possible, at that time, to state 
definitively if the type and length of training was adequate to lead 
to certification of a high percentage of the wdfkshop Piyticipants. 
However, the e valuators who critiqued the tapes, did judge several 
criteria related to the jjarticipants' perfofmance and potential: They 
gave high ratings to 57% for interviewing techniques, 30^o for rating 
techniques, 70% for potential as proficient interviewers> and 65% 
as proficiency raters. A uniform 13% received /ow ratings in all 
categories, with the remaining percentages falling in the average 
category . 

Two conclusions can be drawn from the data. In the first place, 
it is apparent that the intensive training and the interviewing prac- 
tice of at least 24 jnterviews are /wm/wa/ requirements for tester cer- 
tification: In fact, most experienced testers agree that many more 
interviews are necessary before one becomes proficient and comfor- 
table: In the second place, it is obvious that rating is more pro- 
blematic than interviewing. The distinctions within and across levels 
require a thorough understanding of the scale that only comes with 
experience in using it. 

Since a tester certification procedure now exists and is being im- 
plemented by ACTFL/ETS, a more definitive answer to this ques- 
tion will be possible. Participants who complete the initial training 
arid practicum now submit tapes and ratings to ACTFt for valida- 
tion. As a result of this process, data can be gathered to determine 
the rate of success as well as the kind of supplementary training 
needed. 

*2. What demands, in terms of time needed for preparatibri, administra- 
tion, and scoring, does use of the oral prbficiericy interview riiake iipdri 
professors? 

The amount of time required to conduct an oral proficiency in- 
terview, which is of necessity an individual process, is a matter of 
legitimate concern: The data revealed that beginning interviewers 
spent an average of 56 minutes arranging for, conducting, and then 
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reviewing the taped interview to rate it. Almost a quartc. ' <his time 
was devoted to obtaining the specific type of interviewees needed 
for the project, with the remaining three-quarters spent in interview- 
ing and rating. 

As testers gain experience in choosing appropriate questioning 
techniques, the amount of time needed to obtain a ratable sample 
is reduced considerably. While there are rib set time limits, a Novice- 
level interview generally lasts from 5-7 rninUtes, art Ir.tefniediate-level 
about 10-12 minutes; and an Advanced-level or Superior-level from 
15-20 minutes at the most. Taping the irttefview provides a record 
that can be used for delayed rating or to check a rating determined 
immediately upon cdnipletidrt of the irttefview. 

*3. What special cbnditiorts, facilities, artd equipmertt do professors 
heed to use the oral proficiency interview? 

Participants did not indicate a need for special cbnditionsi 
facilities, and equipment, i.e: those not normally available in the 
department, for conducting the interviews. Three-quarters of the par- 
ticipants used their own offices to conduct the interviews while the 
rest used the language laboratory. Portable tape recbrders proved 
adequate though some professors reported problems with 
background noise and low-quality tapes. Students did nbt ajjpear 
to be negatively affected by the presence bf the tape recbrder. It is 
important to make sure that there are hp telephbne calls br similar 
interruptions during the course of the interview. 

*4: What impact does the oral proficiency ^ntervie^y have upon the cur- 
ricuium, the instructional materials, and the prbcedures used tb achieve 
the goals of the course? 

The primary impact was perceived to be rnore emphasis on 
develdpirtg cdnimunicative skills by using proficiency-oriented texts, 
task-dfiertted instructional activities (especially role-play "situa- 
tidrts") and authentic recorded materials in class^ (For a more com- 
plete discussion of specific applications, see Section IV.) 

*5. What presage artd context variables are associated with successful 
use df the dral proficiency interview? 

This questibh was designed to determine what relatidrtships, if 
any, exist between teacher characteristics, prbfessibnal trairtirtg, 
and experiencci and (2) success in interviewing and ratirtg. Because 
of the small sample sizejn each bf the four languages represented 
in the study (Frenchi 8; German, 4; Russian, 4; artd Spartish, 9) the 
analysis was ribt carried but. Experience has shdwrt, however, that 
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interviewers should themselves be at Level 3, and preferably _3 + ; 
on the ILR scale in order to obtain a ratable sample and to make 
accurate rating decisions along the full range of the ACTFL/ETS 
rating scale in academic situations. 



*6. What problems do professors encounter when using the oral profi- 
ciency interview in a foreign language course? 

The three most frequehtly mentioned problems concern (1) assum- 
ing the role of tester especially in the interview; (2) using probes 
appropriately in the interview; and (3) assigning an appropriate 
rating, especially in the Intermediate range. 

(1) There are signifijrant differences between the role of teacher 
and the role of tester. In the classroom, language teachers use several 
typical behaviors to encourage and reinforce karning. These include 
cueing responses, prompting students who hesitate, repeating stu- 
dent responses for correction or reinforcement purposes, and reac- 
ting with evaluative comments, especiaiiy when students respond cor- 
rectly. In the interview, language testers must avoid these kinds of 
behaviors for a very specific reason: obtaining a ratable sample of 
the student's language proficiency depends upon an objective ap- 
proach to interviewing that requires that the student alone show what 
he/she can and cannot do. Prompting, repeating, and encouraging 
by means of judgmental feedback on the part of the interviewer do 
hot allow the student to do this. Thus, teacher-testers must learn 
to wait for answers without prompting, accept what students say 
without correcting or evaluating what is said, and permit students 
to reach the point where they can no longer express themselves com- 
fortably. They must do these things, however, in a non-threatening 
manner that does hot intimidate. Teacher-testers report that it is most 
difficult to accomplish this when one interviews one's own students 
(see 8 below), 

(2) in the interview, probes are used to find out whether or not 
the speaker can go beyond jhe estimated level of proficiency. They 
serve to establish the **ceiling" or highest level at which the speaker 
can function consistently by jaking the speaker to *Minguistic 
breakdown," a point at which he/she can no longer convey mean- 
ing effectively. An interview should include at least three probes in 
order to show conclusively that the speaker cannot sustain speech 
at a higher level. Beginning testers often feel awkward putting 
speakers in such an uncomfortable position once, let alone three times 
during ah interview. They also are not sure of the appropriateness 
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of their probes f.hd may skip a level with the result that the speaker 
is not tested at a level which he/she might have been able to manage. 

(3) Rating the interview requires considerabie familiarity with the 
ACTFL/ETS rating scale and the distinctions between and within 
levels. First, it is extremely important that raters recognize the 
characteristics of each of the four basejevels (Novice, intermediate, 
Advanced, and Superior) in the system. Errors in rating decisions 
across these major borders indicate significant problems with either 
the sample or the matching of the sample to the definitions. Then, 
it is important that raters understand the different degrees of profi- 
ciency (Low, Mid, High) within the levels. While errors in assigning 
these ratings within levels are not as critical as those across levels, 
they are a matter of concern and should be corrected with additional 
practice: It should be noted, however, that the ACTFL/ETS defini- 
tions are stili in a provisional, or developmental, stage, and some 
are acknowledged to need revision to eliminate possible ambiguities. 

*7. What further training is necessary to help professors test oral pro- 
ficiency in their courses? 

Participants in the study agreed that the training sessions and the 
24 critiqued interviews provided them with a basic understanding 
of the principles and procedures, of the evaluation process. They 
acknowledged that increased confidence and accuracy in interview- 
ing and rating are a function of experience. They recbrrimehded nndre 
observation of live or videotaped interviews arid additibrial trairi- 
ing, through shorter follow-up workshops, tb determirie pliis points 
in levels. The evaluators who critiqued the participant tapes indicated 
a need to provide more trainirig in estimating proficiency during the 
interview to make appropriate choices of interviewing techniques 
such as probes. 

*8. What differences, if any, appear in the interviewing and rating 
techniques of professors when interviewing their own students and 
students whom they do not know? 

A critical question is whether or not professors can obtain valid 
and reliable ratings when interviewing their own students. In ririany 
foreign language departments, the number of faculty members 
teaching a particular language may be limited to bne br two pro- 
fessors and thus, if the students are to be interviewed and rated, it 
must be done by someone who is already familiar with what the 
students have done in class. The study provided data regarding the 
participants' perceptions of rating their own students arid the 
evaluators' perceptions. 
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While acknowledging that Interviewing one's own students is more 
difficult than interviewing those one does not know, over two-thirds 
of the participants in the study believed that Interviews with their 
own students were as valid and reliable as those with unknown 
students. Twice as many felt more comfortable with their own 
students then with unknown students (3); however, a similar number 
noted that they had to resist slanting the interview toward 
known vocabulary and grammar. A third of the participants were 
surprised at the differences between the ratings and class performance 
with the usual expectation being a higher rating. 

The evaluators who critiqued the participant tapes affirmed that, 
regardless of whether or not the professors were interviewing known 
or unknown students^ 95<7o of the interviews constituted ratable 
samples. This finding indicates that, in the opinion of these experienc- 
ed raters, it is possible to interview one's own students in a valid 
and a reliable manner by objectively following the guidelines. 

When critiquing the participants' perfdrmahce with regard to 
assigning a rating, the eyalUatdrs found more to qiiestidri. Based bri 
a stratified sample of all the interviews, the evalUatorJ agreed with 
39% of the participants' ratings and disagreed with 61%. When they 
disagreed, they raised the rating 47% of the time and lowered it 53% 
of the time. In 82% of the cases of disagreement, the magnitude 
was one step in the rating scale (from a Low to Mid, for example). 
These findings indicate that beginning raters are likely to rate high 
or low, by one step, but are generally close to the rating assigned 
by experienced raters. 

When the data were analyzed to determine differences in rating 
known and unknown students, the evaluators agreed with the rating 
of known students 26% of the time and with unknown students 54% 
of the time. With more than twice the probability of agreement of 
rating unknown students between experienced and inexperienced 
raters, it appears that factors associated with previous classroom con- 
tact with students do affect a rating given by beginning testers. It 
is interesting to ribte, however, that when evaluators disagreed with 
ratings, they raised them 44% of the time for known students but 
only 38% of the time for uhkribwh students. It wbUld seem that prb- 
fessbrs might be cbrtipehsatihg fbr their experience with kribwri 
students by giving lower ratings. 

The statistics support the participant consensus that it is more dif- 
ficult for beginning testers to rate their own students than to rate 
students who have not studied ^ith them. A reasonable solution to 
the dilemma is to have students interviewed by qualified testers, even 
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if the student has studied with the interviewer, and then have the 
tapes rated by independent eyaluators. When a larger pool of cer- 
tified testers is available^ it will be interesting to continue the study 
of the validity and reliability of fating one's own students: 

*9. What proficiency levels are associated with typical levels of instruc- 
tion (beginning, intermediate, advanced) at the college level? 

Participants in the study interviewed students enroHed in begin- 
ning, intermediate, and advanced language classes and in literature 
and culture classes taught in the foreign language. The resurtant data 
provides some preliminary information on expected levels of profi- 
ciency in each of the typical levels of instruction^It should be noted 
that the sample (beginning students = 35^ intermediate students = 
71, advanced students = 28, literature and culture students = 20) 
is small arid includes all four languages (French, iSerman, Russian, 
and Spanish) of the project: In addition, the ratings were determin- 
ed by beginning testers involved in a training project. As discussed 
above, experienced evaluators disagreed with these ratings 61% of 
the time and lowered the rating 53^o of the time. Half of these 
disagreements involved major boundary decisions (from Intermediate 
to Advanced, for example) and half involved decisions within levels 
(from Mid to High, for example). 

The proficiency levels in each level of instruction were: 

1. Beginning language courses (1st and 2nd semester) 

Spanish (ri = 20): Novice-High 

Intensive Russian (n = 15): Intermediate-Mid 

2: Intermediate language courses (3rd and 4th semester) 
French (n = 32): Intermediate-Mid 
German (n = 23): Advanced 
Russian (n = 15): Intermediate-Mid 

3. Advanced language courses (5 th an^6th semester) 
French (n = 8): Intermediate-High 
Spanish (n = 20): Intermediate— Mid 

4. Literature and Culture courses 

French (n = 8):_ Intermediate-High 
Spanish (n = 12): Intermediate-High 

While it may seem natural to equate level of course with level of 
proficiency, this tendency must be avoided, especially as the level 
of instruction advances. Tentative as the data is, it does reflect an 
important characteristic of the ACTFL/ETS scale: the ranges on the 
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scale are not fixed increments on an absolute scale but father in- 
creasingly larger segments that require progressively greater com- 
mand of function, content, and accuracy of language use. Thus, 
while the Beginning Spanish students were rated predictably in the 
Nov^ice range, all the other students, with the exception of the In- 
termediate German students, were rated in the Intermediate range. 
The fact that 48 of these students were enrolled in what, for instruc- 
tional purposes, are called * 'advanced" courses does not mean that 
they demonstrate proficiency at the Advanced levels. 

As more testers are certified arid rribre students are evaluated us- 
ing the oral prdficiericy iriterview, statistics related to the relatibn- 
ship of course level arid prdficiericy level will be rribre reliable. Such 
statistics will be Useful to departrilerits iri cblleges arid universities 
in determining how their students cdrilpare, iri terrris df prdficiericy, 
with students at similar levels df iristructidri iri dther iristitUtibris. 



The references listed below constitute a selected bibliography of materials in 
print on the topic of the ACTFL/ETS oral proficiency evaluation process.. The 
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a bibliographic catalogae published monthly by the Educational Resource In- 
formation Center (ERIC). RIE catalogues and the Cummu fative Index to Jour- 
nals In Education (CUE), also published by ERIC, are available in university 
libraries. RI IE documents are contained dri niicrbfiche arid generally found iri 
university libraries: They also may be ordered from ERIC. 
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APPENDICES 
i. The interagency Language Roundtable Oral Proficiency Scale 



S-O NO PROFieiENCY 

Unable Id furtclidn in Itie spoken language; Oral ]3rdductidh is limited to dccasidhal 
isolated words. Has essentially no communicative ability. 

S-0+ MEMORIZED PROFICIENCY 

Able to satiny im mediate needs using rehearsed utterances. Shows little real autonomy 
of expression, flexibility, Or spontaneity. Can ask questions or make statements with 
reasonable accuracy only with memorized utterances or formulae. Attempts at creating 
speech are osoally unsuccessful. 

Examples: The S-0 + *s vocabulary is usually limited to areas of immediate survival 
needs. Most utterances are telegraphic; that is, functors (linking words, markers, and 
the like) are omitted, confused, o An S-O + can usually differentiate most 

significant sounds when produced in isolation, but, when combined in words or groups 
of words, errors^may be frequent. Even with repetition, communication is severely limited 
even with.persons used to dealing with foreigners. Stress, intonation, tone, etc. are usually 
quite faulty. 

S-1 EtEMEI^TARY PROFICIENCY (Base Level) 

AbTe to satisfy minimum courtesy requjrements^^^^ Maintain Very V'PP^AfPSy:!^' 
iace conversations on familiar topics. A native speaker must often use slowed speech, 
rejDetitibri, jDaraphrase, or a cdmbihajdri of these td be uriderstobd by an S-1. Similarly, 
the native speaker must strain and employireal-world knowledge to understand even simple 
statements7questidns frditi the S-1. An S-1 speaker has a fiirictidhal, biit limited prdfi' 
ciency. Misunderstandings are frequent, but the S-l is able to^sk for help and to verify 
cdmprehensidh df native speech in face-td-face iriteractidri. The S-1 is unable td pro- 
duce continuous discourse except with rehearsed material. _ _ . _ 

ExampTes: Structural accuracy is likely td be ran^dm dr severely limited. Time cdri- 
cepts are vague. Vocabulary is inaccu'ate, and its range is very narrow. The S-1 often 
speaks with great difficulty: By repe?:iing, such speakers can make themselv^ understood 
to native speakers who aic in r>:j$uiar contact with foreigners but there is little precision 
in the information conveyed: Needs, experience, or training may vary greatly from in- 
dividual to individual; for example, S-ls may have encountered quite different vocabulary 
areas: However, the S-1 can typically satisfy predictable, simple, personal, and accom- 
modation needs; can generally meet courtesy, introduction, and identification re- 
quirements; exchange greetings; elicit and provide, for example, predictable and skeletal 

^1 
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biographical information: An S-1 might give information about_business hours, explain 
routine^rocedures in a limited way, and state in a simple manner what actions will be 
taken The S-1 is able to formulate some questions even in languages with comphcated 
question constructions. Almost_every utterance may be characterized by structural er- 
rors and errors in basic grammatical relations. Vocabulary is extremely limited and 
characteristically does not include modifiers: Pronunciation, stress, and intonation are 
generally poor, often heavily influenced by another language. Use of structure and 
vocabulary is highly imprecise. 

S-1 + ELEMENTARY PROFICIENCY (Higher Level) 

CQti initiate and maintain predkmie face-Jo-Jac^ conversations ar,d satisfyUmited 
social demands: The S-1 + may, however, have Httle_understanding of the socjal con- 
ventions of conversation. The interlocutor is generally required to strain and employ real- 
world knowledge to understand even some simple speech. An S-1 + may hesitate and 
may have to change subjects due to lack of language resources. Range and control of 
the language are limited. Speech largely consists of a series of short, discrete utterances. 

Exgmpies- An S-1 + ij^able to satisfy most travel and accommodation need^ and a 
limited range of social demands beyond exchanges of skeletal biographical information. 
Speaking ability may extend beyond immediate survival needs. Accuracy inbas,c gram- 
matical relations is evident, although not consistent. May exhibit the commoner forms 
of verb tenses, for example, but may make frequent errors in formation and selection. 
While some structures are established, errors occur in more complex patterns. The 5-1 + 
typically cannot sustain coherent structures in longer utterances or unfamjliar situations. 
Ability to describe and give precise iriformatibri is limited: Person, space, and time 
references are often used incorrectly. Pronunciation is understandable to natives used 
to dealing with foreigners. Can combine most significant sounds with reasonable com- 
prehensibility. but has difficulty in producing cert in sounds in certain positions or m 
certain cojnbinations. Speech will usually be labored: Frequently has to repeat utterances 
to be understood by the general public. 

S-2 LIMITED WORKING PROFICIENCY (Base Level) 

AbTe to satisfy routine social demands and limited work requirernents. C^n ^^^^ 
routine work-related interactions that are limited in scope: In more complex and 
sophisticated work-related tasks, language usage generally disturbs the native speaker 
Can handle with confidence, but not with facility, most normal, high-frequency social 
conversational situations including extensive, but casual conversations about current 
events as well as work, family, and autobidgraphical information. The S-2 can get^the 
gist of most everyday conversations but has some difficulty understanding native speakers 
in situations that require specialized or sophisticated knowledge. The S-2's utterances 
are minimally cohesive: Linguistic structure is usually not very dabbrate and not 
thoroughly controHed; errors are frequerit: Vocabulary use is appropriate for high- 
frequency utterances, but unusual or imprecise elsewhere. 

Examples' While these Jnteractibris will vary widely from individual to individual, 
an S-2 can typically ask and answer predictable questions in the workplace and give 
straightforward instructions to subordinates: Additionally, the S-2 can participate m per- 
sonal arid accommodation-type interactions with elaboration and facility; that is. ^^^^ 
give and understand complicaled, detailed, and extensive directions, and make non-routine 
changes in travel and accommodation arrangements. Simple structures and basic gram- 
matical relations are typically controlled; however, there are areas of weakness. In the 
commonly taught languages, these may be simple markings such as plurals, articles. Unking 
words, and negatives or more complex structures such ^^^^^ usage, case mor- 

phology, passive constructions, word order, and embeddirig: 



' ' 32 



28 



S-2+ LIMITED WORKING PROFICIENCY (Higher Level) 

^J'^^'^^My '^P^^ /^P/l^ /"f J^^^"''^^ usage that is often, hut not 
always, gcceptable and effective. An 5-2+ shows considerable ability to communicate 
effectively on topics relating to particular interests and special fields of competence. Often 
shows a high degree of fluency and ease of speech, yet when under tension or pressure, 
the abi^lity to use the language effectively rs.'^y detmor^te. Cdmprehensldri of normal 
natjve speech is typically nearly comprete. An S-2 + may miss cultural and local references 
and may require a native speaker to adjust to his/her limitations m some ways. Native 
speakers often perceive the S-2+ 's speech to contain awkward or inaccurate phrasing 
o/ ideas, mistaken time, space, and person references, or to be in some way ihapprdpriate, 
if riot strictjy incorrect. 

Examples: Typically an S-2+ can participate in most social, fdrriial, arid informal 
e[ther In range of contexts, types of tasks, or level of ac- 
curacy hinder effectiveness. The S-2+ may be ill at ease with the use df the lariguage 
f so^^^li"^^^ at length in professional contexts. An S-2 + 

is generally strong in eilher structural precision or vocabulary, but riot iri both. Weakriess 

""^^^""^^ in prdriunciatldn. occasionally resujts in miscom- 

munication. Normally controls, but cannot always easily produce gerieral vocabulary. 
Discourse is dfteri iricdhesive. 

S-3 GENERAL PROFESSIONAL PROFieiENev (Base Level) 

Able to speak the Tanguage with sufftdeht struct urai accuracy and vocabulary to par- 
tJcipatc effecU^^^ most for mai and informal conversations on practical, social^ and 
professional topics. Nevertheless, an S-3's limitations gerierally restrict the prdfessiorial 
cdntexts of language of shared knowledge and/or international conven- 

tion. Discourse is cohesive. An S-3 uses the language acceptably, but with sdriie ridticeable 
imperrectidnsi^yet, errors virtually never interfere with understanding and rarely disturb 
the native speaker. An S-3 can effectively combine stf uctiire arid vocabulary td cdrivey 
^*X'^*^^^''J"^_^I"A"^^ speaks readily and fills pauses suitably. In face-to- 

face conversation with natives speaking the standard dialect at a normal rate of speech, 
cdmpreherisidri is quite cdmp cultural references^ pro verbs, and the im- 

piications of nuances and idiom may not be fully understood, the S-3 can easily repair 
the cdriversatidri. Prdriuriciatidn may be dbvidusly fdreigri. Individual sounds are ac- 
curate; but stress, intonation, and pitch control may be faulty: 

Examples: Cart typically discuss particular imer^^^ and special fields of competence 
with reasonable ease. Can us_e the language as part of normal professional duties such 
as answering objections, clanfyirig pdijits, justifying decisidris^ understanding the essence 
of challertges, stating artd defirtdirtg policy, cortductirtg meetings, delivering briefings, 
or other extended and elaborate irifdrrifiative mdridldgues. Can reliably elicit informa- 
tion and informed opinion from native speakers.. Structural inaccuracy is rarely the ma- 
jor cause of misunderstanding. Dse of structural devices is flexible arid elaborate. Without 
searching for words or phrases, an S-3 uses the language clearly and relatively naturally 
to elaborate concepts ft*eely and make ideas easily uriderstaridable td native speakers. 
Errdrs occur iri Idw-frequency and highly complex structures. 

S-3+ GENERAL PROFESSIONAL PROFICIENCY (Higher Uvel) 

Jl/^/!'?^J^^J'?J^_ H^\ fo satisfy professional needs in a wide range of 

sophisticated and demanding tasks: 

Pfspjte obvious strength may exhibit some hesitancy, uncertainty, ef- 
fort j_or errors which limit the range of language-use tasks that cari be reliably perrorrii- 
^^\TyP^5^yyj^?r?JS_part^^^^^^^ »n fluency and one or more, but not all, of the 

following: has breadth of lexicon, including low- and niediurii-freqiiericy iterils. especially 
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sdtMdlingUistic/culiural references arid naances of close synonyms;. employs structural 
j5recision, with sqphist|catcd_features t 

trolled (such as complex modification and embedding in Indo-European languages); has 
discourse competence in a wide range of cbntexis arid tasks, often matching a native 
speaker's strategic and organizational abilities and expectations. Occasional patterned 
errors occur in low-frequericy arid highly cbmijlex structures. 

S-4 ADVANCED PROFESSlONAt PROFieiENCY (Base Level) 

Able to use the Tanguage JTuently and accurately oh aV Tevets normatiy perrtnenJ to 
professional needs. An_S-4*s language usage and abiliiy to function are fully successful. 
Orgariizes discourse well, employing fiirictiorial rhetorical speech devices, native cultural 
references, and understanding. Language ability only rarely hinders him/her in perfor- 
mirig ariy task requiring language; yet, an S-4 wOuld seldom be perceived as a native: 
Sp.eaks effortlessly and smoothly and is able to use the lariguage with a high degree of 
effectiveriess, reliability, and precision for all representaiional purposes within the range 
pf_perso_nal and professional experience and scope of resijonsibilities. Cari serve as an 
informal interpreter in a range Of unpredictable circumstances. Can perform extensive, 
sophisticated language tasks, encompassing most matters of interest to wen-educaied riaiive 
speakers, including tasks which do not bear directly on a professional speLcialty.. 

Exampfes: Can discuss in detail concepts which are furidariieriu 
those Of the target culture and make those concepts clear and accessible to the native 
speaker. Similarly, an S-4 can understarid the details a nd ram^^ 

are culturally of conceptually different from his/her own. Can set the. tone of interper- 
sonal official, semi-official, and nori-professional verbal exchanges with a representative 
range of native speakers (in a range of varied audiences, purposes, tasks, and settings). 
Cari play an effective nDle ariiorig native spejikersm such cbrite^ as conferences , lec- 
tures, and debates on matters of disagreement.. Can advocate a position at length, both 
formally and in chance ericbUnters^ using sop^ verbal strategies. Can under- 

stand and reliably produce shifts of both subject matter and tone. Can understand native 
speakers of the staridard arid bt her rriajbr dialects in essentially any face-to-face interaction: 

S-4+ ADVANCED PROFESSIOISAL PROFICIENCY (Higher Level) 

Speaking proficiency is regutarly superior in att respecTs, usuatty equivaiem TO that 
of a weji-educated^ highly articulate native speaker. Language ability does not impede 
the performance of any larigUage-Use task: However, an S-4+ would not necessarily be 
perceived as culturally native. 

Examptes: An S-4+ organizes discourse well, employing functional rhetorical speech 
devices, native cultural references and understanding. Effectively applies a riaiive speafe 
social and circumstantial knowledge: However, cannot sustain that performance under 
all circumstances. While an S-4+ has a wide rarige and cbntro! of structure, an bcca- 
siorial non-native slip may occur: An S-4+ has a sophisticated control of vocabulary 
and phrasing that Is rarely imprecise, yet there are occasiorial weaknesses ^ 
loqUialisms, pronunciation, culiaral reference or there may be an occasional failure to 
interact in a totally native mariner. 

S-5 FUNCtiONALLY NAtiVE PROFiCiENCY 

Speaking proficiency is functionally equivalent to that of a highly articulate, well- 
educated native speaker and reflects the cultural standards of the country wh^r? the 
language is natively spoken. An S-5 uses the language with complete flexibility and Iri- 
tuitibri, so that speech on aU levels is fully accepted by well-etlucated native speakers 
in all of its featares,_inciuding_breadth of vocabulary and idiom, colloquialisms, and 
pertirierit cultural references. Pronunciation i«: typically consistent with that of well- 
educated native speakers of a non-stigmatized di_ale_ct. 
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2. The AeXFL/ETS Oral Proficiency Scale 

Novice — tow 

Unable to function in the spdReri language. Oral pfdductidn is limited to dccasicnal 
isolated words. Essentially no communicative ability. 

Novice — Mid 

Able to operate only in a very Hmiied capacity within areas df need. 

Vocatjulary limited to that necessary lo express simple elementary. needs and basic courtesy 
formulae. Syhjax is fragmented, inflections and word eridirigs frequently dmitted, cdh- 
fused. or distorted, and the majority _of utterances consist of isolated words or short 
formulae. Utterances rarely consist df mdre than twd or three wdrds and are marked 
by frequent long pauses and repetition of an interlocutor's words. Pronunciation is fre- 
quently unintelligible arid is strdrigly irifluericed by first language: Can be understood 
only with difficulty, even by persons such as teachers who are used to speaking with 
ribri-riative speakers dr iri inter actidns where the cdritext strongly supports the utterance: 

Novice— High 

Able to satisfy immediate needs using learned utterances. Can ask questions or make 
staterrierits with reasdriable accuracy drily where this involves short riiemdrized utterances 
or formulae. There is no real a_utonomy of expressions although there may be some emerg- 
irig sigris df spdritarieity arid flexibility. There is a slight increase iri iitterance length but 
frequent long pauses and repetition of interlocutor's words still occur. Most utterances 
are telegraphic arid wdrd eridirigs are dfteri omitted, confused, of distorted: Vocabulary 
is limited to areas of immediate survival needs. Can differentiate most phonemes when 
prddiiced in isolation but when they are combined in words or groups of words, errors 
are frequent and, even with repetition, may severely inhibit cdmmunication even with 
persdris used to dealirig with such learrieis: tittle development in stress and intonation 
is evident. 

Intermediate — tow 

Able to satisfy basic survival needs arid minimum coartesy reqairements. In areas 
of immediate need or on yer j familiar topics, can ask and answer simple qucstidris, in- 
itiate and respond to simple statements, and maintain very simple face-to-face conversa- 
tions . When asked to do so, is able to formulate some questions with limited consiruc- 
tidns and much inaccuracy: Almost every utterance contains fractured syntax and other 
grammatical errors. Vocabulary inadequate to express ahy^^ 

tary needs: Strong interference from native language occur*; in articulation, stress, and 
intonation. MisUiiderstandings frequently arise from limited vocabulary arid grammar 
and erroneous phonology bat, with repetition, can generally.be understood by native 
speakers in regular contact with foreigners attempting to speak their language. Litt'e preci- 
sion in information conveyed owing to tentative state of grammatical development and 
little dr rid use of mddifiers. 

Intermediate — Mid 

Able to satisfy some survival need and some limited social demands. Is ablejo for- 
mulate sdme questions when asked td dd sd. ydcabiilary perniijs discussiori^ ^opiS^^ 
beyond basic survival needs such as personal history and leisure lime activities. Some 
evidence df grammatical accu^^ iri basic constructions, for example, subject-verb agree- 
ment, noun-adjective agreem':nt, some notion of inflection. 
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Intermediate — High 

Able to satisfy nidst survival needs and limited social demands: Shows some spon- 
taneity in language production but fluency is very uneven. Can initiate and sustain a 
general cdnvcrsatidn but has little understanding of the social conventions of conversa- 
tion. Developing fjexibility in a range of circumstances beyond immediate survival needs. 
Limited vocabuKiry range necessitates mach hesitation and circamlocaation. The com- 
moner tense forms occur but errors are frequent in formation and selection. Can use 
most question forms. While some word order is established, errors still occur in more 
complex patterns. Cannot sustain coherent structures iri^ longer utterances or unfaniiliar 
situations. Ability to describe and give precise information is limited. Aware of basic 
cohesive features such as pronouns and verb ihriectiqris, but [ri any ajc unreliable^ ^^pecially 
if less immediate in reference. Extended discourse is largely a series of shorty discrete 
ut t er a nccs , A rt icu la hqn i^ co m p reh en sib 1 e t d ria t i ve s pea k er s^ us ed to d e a li rig with 
foreigners, and can combine most phonemes with reasonable comprehensibijity, but still 
has difficulty iri prdducirig ccrtairi sdiirids, iri certairi pdsitidriSt or iri ccrtairi cdrhbiria- 
tions» and speech will usually be labored. Slili has to repeat utterances frequently to be 
uriderstdbd by the gericral public. Able to prddUce sdrrie riarratidri iri either past or future. 



Able to satisfy routine social demands and Hmited work requirements. Can handle 
with confidence bat riot with facility most social situations including introductions and 
casual conversations about current events, as well as work, family, and autdbidgraphical 
information; can handle limited work requirements, needing help in handling any com- 
plications or difficulties. Has a speaRirig vocabulary sufricieht to respond sirriply with 
some circumlocutions; accent, though often quite faulty, is intelligible; can usually han- 
dle elementary coristructidris quite accurately but does ridt have thordugh dr cdrifiderit 
control of the grammar. 

Advaneed Plus 

Abie to satisfy most work requirements and show some ability to communicate on 
cdricrete tdpics relatirig td particular iriterests arid special fields df cdmpetence: C ;neral- 
iy strong in eiiher grammar or vocabulary, but not in both. Weaknesses or unevenness 
iri drie df the fdregdirig dr iri prdriuriciatidri result iri dccasidrial miscommunicatidri. Areas 
of weakness range from simple constructions such as plurals, articles, j3repositions, and 
negatives, to more complex structures such as tense usage, passive constructioris, word 
order, and relative clauses. Normally controls general vocabulary with some groping for 
everyday vocabulary and relative clauses; Normally controls general vocabulary with some 
groping fdr everyday vocabulary still evident. Ofen shows remarkable fluency and ease 
of speech, but under tension or pressure language may break down. 



Abie to speak tlie language with sufficient structural accuracy and vocabulary td par- 
ticipate effectively iri riidst fdrmal arid irifdrmal conversations on practical, social, and 
professional topics. Can discuss particular interests arid special fields of cbrripeterice with 
reasdriable ease: Vocabulary is broad enough thai speaker rarely has to grope fora word; 
accent may be dbvidusly fdreign; contrdl df grammar goojl; errors virtually never iri- 
terfere with understanding and rarely distrub the native speaker. 



Advanced 



Superior 




